Multiple instance learning of Calmodulin binding sites

نویسندگان

  • Fayyaz ul Amir Afsar Minhas
  • Asa Ben-Hur
چکیده

MOTIVATION Calmodulin (CaM) is a ubiquitously conserved protein that acts as a calcium sensor, and interacts with a large number of proteins. Detection of CaM binding proteins and their interaction sites experimentally requires a significant effort, so accurate methods for their prediction are important. RESULTS We present a novel algorithm (MI-1 SVM) for binding site prediction and evaluate its performance on a set of CaM-binding proteins extracted from the Calmodulin Target Database. Our approach directly models the problem of binding site prediction as a large-margin classification problem, and is able to take into account uncertainty in binding site location. We show that the proposed algorithm performs better than the standard SVM formulation, and illustrate its ability to recover known CaM binding motifs. A highly accurate cascaded classification approach using the proposed binding site prediction method to predict CaM binding proteins in Arabidopsis thaliana is also presented. AVAILABILITY Matlab code for training MI-1 SVM and the cascaded classification approach is available on request. CONTACT [email protected] or [email protected].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

pyLEMMINGS: Large Margin Multiple Instance Classification and Ranking for Bioinformatics Applications

Motivation: A major challenge in the development of machine learning based methods in computational biology is that data may not be accurately labeled due to the time and resources required for experimentally annotating properties of proteins and DNA sequences. Standard supervised learning algorithms assume accurate instancelevel labeling of training data. Multiple instance learning is a paradi...

متن کامل

MBSTAR: multiple instance learning for predicting specific functional binding sites in microRNA targets

MicroRNA (miRNA) regulates gene expression by binding to specific sites in the 3'untranslated regions of its target genes. Machine learning based miRNA target prediction algorithms first extract a set of features from potential binding sites (PBSs) in the mRNA and then train a classifier to distinguish targets from non-targets. However, they do not consider whether the PBSs are functional or no...

متن کامل

Interactions of calmodulin with the multiple binding sites of Cav1.2 Ca2+ channels.

Although calmodulin binding to various sites of the Cav1.2 Ca(2+) channel has been reported, the mechanism of the interaction is not fully understood. In this study we examined calmodulin binding to fragment channel peptides using a semi-quantitative pull-down assay. Calmodulin bound to the peptides with decreasing affinity order: IQ > preIQ > I-II loop > N-terminal peptide. A peptide containin...

متن کامل

Computational comparison of a calcium-dependent jellyfish protein (apoaequorin) and calmodulin-cholesterol in short-term memory maintenance

Memory reconsolidation and maintenance depend on calcium channels and on calcium/calmodulin-dependent kinases regulating protein turnover in the hippocampus. Ingestion of a jellyfish protein, apoaequorin, reportedly protects and/or improves verbal learning in adults and is currently widely advertised for use by the elderly. Apoaequorin is a member of the EF-hand calcium binding family of protei...

متن کامل

Different Learning Levels in Multiple-choice and Essay Tests: Immediate and Delayed Retention

    This study investigated the effects of different learning levels, including Remember an Instance (RI), Remember a Generality (RG), and Use a Generality (UG) in multiple-choice and essay tests on immediate and delayed retention. Three-hundred pre-intermediate students participated in the study. Reading passages with multiple-choice and essay questions in different levels of learning were giv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 28  شماره 

صفحات  -

تاریخ انتشار 2012